Drafting Visualizations
1. Which option do you plan to pursue?
I plan to pursue option 1
2. Restate your questions. Has this changed at all since HW #1? If yes, how so?
Is there a gender gap in academic publishing and if so, what does it look like?
- What is the gender gap across different academic disciplines?
- How has the gender gap changed over time?
- What does the gender gap look like from country to country?
I think these questions have changed slightly. I was originally interested in understanding how the gender publishing gap affects women’s career mobility and opportunities overall. That, however, is a much more complicated question that I do not have the data for. So, I’ve decided to go with a simpler approach.
3. Explain which variables from your data set you will use to answer your questions, and how.
The variables I will use to answer this question are as follows:
authors: int; represents the total number of authored publications for a specific country, gender, time period, and field.country: chr; the country of origin of the publicationsgender: chr; the gender of the publishersubject_area_or_subfield: chr; the academic discipline the paper was published fromperiod: fct; represents one of two time periods (1993-2003) or (2014-2018). There is no information on publications between or after these time periods.geometry: vec; country boundaries so I can map gender differences across countries
All this information comes from one dataset about gender and academic publishing created by Elsevier with the exception of the geometries, that come from a world administrative boundaries shape file.
Most of the variables I can use as-is in my visualizations to answer my questions. There is some wrangling that has to happen, and it mostly involves generating summary statistics of publications by gender and country, or gender and academic discipline.
4. Find at least two data visualizations that you could borrow/adapt pieces from and explain which elements you might borrow.
I like this data viz as a cool way I might show the gender gap amongst different academic disciplines. Currently, I’m using a bubble chart with the percentage of total publications by gender. I like this viz a lot because it really centers the industries themselves, rather than the numbers. I think this would work nicely with the data I have, however, I am also intimidated about trying to accomplish this.
If I wanted to switch up my variables and display gender gap by country with a dumbbell chart, it could look something like this. I like how the values along the x-axis are also located inside the bubbles themselves, that way you don’t have to work to hard to see what they are. I also like the gender color choices here.